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Abstract 



We have exactly solved the relaxational dynamics of a model protein which 
possesses a kinetically perfect funnel-like energy landscape. We find that the 
dependence of the relaxation time, r, on the density of states (DOS) and the 
energy level spacing distributions of the model displays several main types of 
behavior depending on the temperature T. This allows us to identify possible 
generic features of the relaxation. For some ranges of T, r is insensitive to 
the density of states; for intermediate values of T it depends on the energy 
level spacing distribution rather than on the DOS directly, and it becomes 
gradually more dependent on DOS with increasing temperature; finally, the 
relaxation can also be determined exclusively by the presence of a deep gap 
in the energy spectrum rather than by the detailed features of the density of 
states. We found that the behavior of r crucially depends on the degeneracy 
of the energy spectrum. For the special case of exponentially increasing de- 
generacy, we were able to identify a characteristic temperature which roughly 
separates the relaxational regimes controlled by energetics and by entropy, 
respectively. Finally, the validity of our theory is discussed when roughness 



of energy landscape is added. 
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I. INTRODUCTION 



It is well known that natural proteins fold into their native structures remarkably quickly 
in times on the order of a second in spite of the enormous number of possible physical con- 
figurations On the other hand, it is also clear that heteropolymers with completely 
random monomer- monomer interactions usually do not fold on a reasonable time scale ||. 
One explanation put forward to resolve this discrepancy is that protein sequences are "op- 
timized" such that not only is there a stable unique structure for the ground state ||, but 
there is also a funnel-like energy landscape which leads to efficient folding kinetics • A 
principle of minimal frustration was proposed || to enforce a selection of the interactions 
between monomers such that as few energetic conflicts occur as possible. Among other 
things, considerable theoretical effort has concentrated on finding proper models for protein 
folding and investigating various sequencings which lead to fast folding kinetics. Due to 
the immense complexity of the problem, much of our understanding and intuition has been 
obtained from a variety of computer simulations based on lattice models |9||lCj . 



In this paper we concentrate on the folding scenario involving a funnel-like energy land- 
scape where the funnel "guides" the protein into the low energy native structure. Along 
the pathway, the protein is believed to go through several distinct states including the molten 
globule state, a folding transition region, and a glass transition region. Even though the 
funnel landscape possesses a certain amount of roughness which slows the folding kinetics, 
the folding process is largely speaking guided by the global funnel structure and the protein 
can in this way rapidly find its native state. Although there is no clear experimental evi- 
dence of the existence of this folding scenario, it is nevertheless theoretically interesting and 
attracted much attention in the literature. In general, the folding kinetics for funnel-like 
energy landscapes is very complicated and analytical studies have proved to be quite diffi- 
cult. In this regard, an interesting study is due to Zwanzig J7J where some of the general 
properties of the folding kinetics were examined using an extremely simple model. 

Even though there are many folding models for specific proteins, we believe that it is 
intuitively useful to investigate the generic behavior of the folding kinetics. This, in some 
sense, is similar to finding universality classes in critical phenomena. The purpose of this 
paper is to report our studies in this direction. In particular we examine a simple statistical 
mechanical model which mimics all the basic properties of a perfect funnel-like landscape in 
the absence of roughness. We also discuss the validity of our results for the case of involving 
small amount of roughness (see below). The landscape itself consists of a set of energy levels 
forming a quasi-continuous spectrum with a single level lying far below this spectrum. All 
the levels represent conformational energies of the protein with the lowest level representing 
the native state. This model is quite general and is not exclusive to proteins. It could for 
example represent certain classes of polymers. 

There are many interesting questions concerning protein folding kinetics which we would 
like to answer from an analysis of our model. For instance, for a protein sequence which 
folds rapidly to its native state, what is the role played by the energy spectrum along the 
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folding pathway ? What is the role of energy level spacing statistics on the folding kinetics 
? How does the relaxation process of our system depend on parameters such as temperature 
? These are interesting and difficult general questions which are relevant to the folding 
kinetics, since the protein passes through the energy spectrum during the folding process. 
An analytical answer to these questions for a general protein problem has so far been not 
possible. However as we show below for our model which is a generalization of that studied 
in Ref. 0, analytical solutions can be found when the funnel structure has no roughness 
and reasonable approximations could be made to find an answer when small roughness is 
included. Our perspective is that exact solutions are valuable since they can be used as 
a starting point for further more complicated models, similar to our experiences in critical 
phenomena and phase transitions. 

To this purpose, we have derived analytical expressions for our model (see below) which 
show that the energy spectrum and the energy spacing statistics can play an important 
role in folding kinetics. In particular the dependence of the folding time, r, on the energy 
level distribution of the various models can be classified into three main types of behavior 
depending on the temperature T. For a considerable range of T, r is insensitive to the 
level distribution; for intermediate values of T it depends directly on the distribution and 
becomes gradually more model dependent; finally, in the third case the folding kinetics is 
determined exclusively by the presence of the deep gap in the energy spectrum rather than 
by the details of the energy level distribution. We found that the behavior of r crucially 
depends on the degeneracy of the energy spectrum. For the special case of exponentially 
increasing degeneracy, we were able to identify a characteristic temperature which roughly 
separates the relaxational regimes controlled by energetics and by entropy respectively. Our 
general formula for this simple model is consistent with existing literature in the appropriate 
limits and we present numerical solutions to confirm the physical picture indicated by our 
analytical results. 

The paper is organized in the following manner. A general expression for the relaxation 
time is derived in the next section. Sections III and IV presents results for the applications 
of this expression. Finally a short summary is presented in the last section. 

II. RELAXATION KINETICS 

By analogy with Ref. |]], we focus on a perfect funnel- like energy landscape defined 
by an abstract "reaction coordinate" X. For example X could represent a specific protein 
structure which has energy E(X). We emphasis again that whether or not a real protein 
possesses funnel-like energy landscape is unclear, but we shall examine the consequences 
of this landscape. A perfect funnel with no roughness is schematically shown in Fig. ([[]). 
Clearly this is a considerable simplification of the problem, but it allows us to investigate 
the relaxational kinetics completely analytically. As discussed below, other features can be 
systematically added on later, such as a small amount of roughness. 
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Thus, as the system relaxes or the "protein" folds, it rolls down the funnel, E(X), to the 
final native structure characterized by energy E a . Now a particular model can be described in 
terms of its energy level distribution or its density of states. Here we consider the situation 
where there are N quasi-continuous energy levels with density of states, D(E), and one 
distant level, E Q , lying at a distance AE Q below the quasi- continuous spectrum (see Fig. 
(0)). During the folding process (relaxation process) we assume [|7| that in any transition 
between configurations, X, changes only by ±1 which means that the system performs a 
nearest neighbor random walk in one-dimensional reaction coordinate space. For the perfect 
funnel energy landscape considered here, X — > X ± 1 also implies that energy transitions 
only occur between nearest neighboring levels. In this work we use Metropolis transition 
rates, same as that used in Ref. [^], which is justified since no roughness is included in the 
model. On the other hand if the landscape is very rough indicating entanglement of the 
polymer or protein, Metropolis rates will not be adequate. The Metropolis rates satisfies 
the requirement of detailed balance 



W(X — X + l)Px(eq) = W(X + 1 -> X)P x+l {eq) 



(1) 



where Px{eq) is the equilibrium distribution given by a Boltzmann factor. Using gx and 
gx+i as the degeneracies of the X and X + 1 energy levels, one can then introduce the 



following transition rates H^(X — > X + 1) 



9x+i 

ax 



exp 



-(E x+ ,-E x ) w ^ + ! _> = 1. 



The second condition, which is independent of temperature T, corresponds to the zero- 
roughness on the E(X) landscape. 

The folding or relaxation kinetics is studied using a master equation. We focus on 
the probability, Pi (i = 0, 1, 2, ...N), of being at energy level Ei during the relaxation. 
Introducing variables = exp — 4^ where AF, = F i+ i — Fi i G {0, iV — 1}, and F{ = Ei — 
T In gi we can write down the following matrix equation for the evolution of the probabilities 



dP 



AP 



(2) 



where the matrix coefficient is given by 
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(3) 



Because the total probability is a constant, the matrix A is thus degenerate and there 
are only N independent probabilities out of a total of N + 1. We denote the average 
nearest neighbor energy level spacing by U. We then make the reasonable assumption that 
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U <C AE Q which is basically a consequence of having a spectrum with a few low lying energy 
levels and a quasicontinuous part in the upper part of a spectrum. Several limits can be 
obtained directly from the form of the matrix A. For < T ^ U and U T AE , A 
essentially becomes a constant matrix largely independent of temperature except with only a 
few temperature dependent terms and thus the folding kinetics is independent of the energy 
spectrum, D(E). The same is true for very high temperatures where entropy is the dominate 
factor. At temperatures T ~ AE Q ^> U, almost all matrix elements of A become constants 
except those few involving energy differences comparable to the AE a . This suggests that 
in this case the folding kinetics is exclusively determined by the presence of the gap in the 
energy spectrum while almost all quasi-continuous levels are already excited. 

A non-trivial result was obtained for the temperature range T ~ U. Here the kinetics can 
substantially depend on the energy distribution of the quasi- continuous part of the spectrum. 
We first find a relaxation time at temperatures small enough that one can disregard the rate 
of escape from the native state. In this range the master equation of @ becomes dP Q / dt = Pi 
and 

dP' 

W = M.I». (4) 

where P' is a vector (Pi, P2, P/v). The matrix M is a sub-matrix of A without its first row 
and first column. As the relaxation proceeds, i.e. when our system rolls down the perfect 
funnel E(X), the total relaxation time from the highest energy level E^ to the lowest one 
E , gives a measure of the relaxation time. To determine this relaxation time, t re i, we notice 
that the system has relaxed when all states with P z , i 6 {1,N} have been sequentially 
relaxed. Therefore t re i ~ J2f=i U where t% are the relaxation times for the states with Pj. 

It is worth pointing out that at low temperatures the relaxation time of a system coin- 
cides with its folding time into the ground (native) state because the native state at these 
temperatures is an equilibrium state of the system. As temperature increases, however, the 
equilibrium state of the system shifts to the quasicontinuous part of the spectrum. Hence 
t re i will characterize the folding time to the appropriate equilibrium. Keeping this in mind, 
we now calculate t re [. From Eq. (f|) we have Y,iLi U = ~ J2iLi x~ where A« are the eigenvalues 
of the matrix M. As det(M) 7^ 0, j- are the eigenvalues of matrix M~ l . In this way we 
finally obtain t re \ in terms of the trace of M~ l since t re \ ~ — Tr(M _1 ). 

The calculation of Tr(M~ l ) is quite lengthy |TT[ and we outline the main steps here. 



We notice that M = L + 5L where L is a constant matrix and SL contains the temperature 
dependent elements of M. We seek M~ l in a perturbative form given by M~ l = Y^t^o 
It turns out that this sum only has a finite number of non-vanishing terms and can thus be 
summed exactly. Essentially, from M ■ M^ 1 = I one derives a set of equations for which 
can be solved to give = (—iy(S°5LyS°, where S° is a triangular matrix with all the 
upper right elements equal to —1. Because of the simplicity of SL and S°, (S°6L) 1 = for 
all i > N. After lengthy but straightforward algebra we obtain 
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(5) 



r = -Tr(M 1 ) = N + ai + a 2 + ■ ■ ■ + a\a 2 + a 2 a 3 + • • • + aia 2 a 3 + 



(6) 



We now solve the problem in general for all T. To find the relaxation time r of this 
perfect-funnel model, we use the fact that the total probability is conserved, i.e. that 



N 



(7) 



Then equation (2) is equivalent to the following equation, 

dP' 

a 5 ifi + (M - a 8M) ■ P' . 



dt 



(8) 



Thus the kinetics will basically be determined by the matrix M — olq5M where 5M is a 
matrix with a first row consisted of 1 and the rest of the elements equal zero. As discussed 
above, the relaxation time of the system can be found as the trace of (M — aoSM)^ 1 . To 
compute this trace we use the perturbation expansion introduced above and seek an inverse 
matrix in the following form 



(M - a 5M) 



-i 



H(o) + H(d + H( 2 ) + ■ 



(2) 



(9) 



Using the identity 



I = (M - a 5M)(H {0) + H {1) + H { 



(2) 



we find a set of equations for which can be solved to give 
where M~ l is found earlier and is given by Eq. (B). Hence, 



(M - a Q 5M)- 1 = M-\l + aoiSMM' 1 ) + a 2 {5MM- 



1\2 



+ 



Introducing a new quality 

Ri = 1 + ai + otiOti+i + aia i+ ia i+3 + ■ • ■ + a { ■ ■ ■ a N 
one can easily show that the required inverse matrix can be written as 



(10) 



(M - a^M)- 1 = M~ l + aoM^SMM^il - (a Ri) + (a #i) 2 



(« 0j Ri) 3 
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This is an important result and the trace of this matrix gives the relaxation time of our 
model. 

Formally the sum in the bracket of Eq. fllTD is a geometric series, which we can rewrite 
in a more compact form, 

(M - a^M)- 1 = M~ l + ^—M-^MM- 1 . (12) 

1 + otoRi 

Strictly speaking the summation of the series is allowed only when (ao-Ri) < 1. Thus in 
principle we should use Eq. (O) when this condition is not satisfied. It turns out, by 
numerical comparisons, that Eq. ( |T2"D is correct even for (ao-Ri) > 1- Even though this 
indicates that there is most probably a more direct way of deriving Eq. ( ]T2"| ) rather than 
via the series expansion used here, we use flTjp to proceed further and present numerical 
confirmation of this procedure later. 
The relaxation time is then given by 

r = -Tr(M^) < ^—Tr(M- 1 5MM- 1 ) . (13) 

1 + OLqR\ 

Using the explicit forms of the matrices as given by Eq. (fj|) and SM, after lengthy but 
straightforward algebra we obtain, 

N-l N-i 

Tr(M~ l ) = N+Y,T, a r -<Xj+i-i > ( 14 ) 

i=l j=l 

and 

Tr(M~ 1 5MM^ 1 ) = R x + {R x + R 2 ) ai + {R x + R 2 + R 3 ) ai a 2 + ■■■ 

+ (Rx + ■ ■ ■ + Rif)ax • -QJjv-i • (15) 

If we define Zi as Zi = Y^f=i ex P is then the partition function for the N — i 

energy levels starting at % and Ri is given by Ri = Ziexp -p-. The above results can then be 
considerably simplified and the expression for r becomes 

r = -Tr{M- x ) - U=1 1 P T . (16) 

Zo 

Using Eqs. (14) to (16) we arrive at the main result of this work, 

T = T)x+T)2 (17) 

where 

JV-l N-i 

vi = (n+ EE«i' •«*+*-!) ( 18 ) 

i=l j=l 

r? 2 = = ■ (19) 

Because this result is quite complicated, we first apply it in the next section to various 
specific situations, and then present numerical results obtained using the general expression 
of Eqs. (13) to (|TJ). 
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III. THE ROLE OF LEVEL STATISTICS 



As a first application of the result given by Eq. (0), we examined the case where 
the energy levels are non-degenerate, i.e. gx — 1 for all X, then local free energy Fi 
coincides with the energy E^. In this limit the expression for the relaxation time can be 
greatly simplified. Also, we shall focus on low temperatures. In this case we shall prove 
that the relaxation time r is determined by the energy level spacing distributions. These 
distributions can be computed from the density of states D(E). Since very different models, 
i.e. different D(i?)'s, can give quite similar spacing distributions, the relaxation in this 
temperature regime is quite generic. In the next section we shall show that even when 
gx > 1, similar conclusions can be reached if temperature is lower than some characteristic 
temperature. 



A. The relaxation time 

For temperatures much smaller than the energy gap between the ground state and the 
first excited state, AE Q , and taking into account that there are no exponentially divergent 
pieces in the second term 7/2 of Eq. (|TTD, we can safely neglect 772 since it is much smaller 
than the r/i term. Furthermore we notice that 

a i a i+a ..a i+i _i = ^ p _ & j+1 -E j +E j+2 -E j + 1 +...+E j+i -E d+i ^ 

= exp- ^±p^ 

The relaxation time r can therefore be expressed as follows in terms of the level spacing 
probability 

N-i 

PM(S) = '£6{E j+i -E j -S) 
3=1 

which measures the probability that Ej + i — Ej equals S for all level indices j. Using this 
definition we find: 



N-i 



«j • -tti-M-i = / P(i){S)exp--dS . 
3=1 J 

From Eqs. (0) and (|18D we obtain 

N- 1 r +oo Q 

r = t rel ^N+J2 / P(i)(S)exp--dS . (21) 

i=l Jo 1 

It is clear that small values of the level spacing, S, give the largest contribution to the 
integrals in (|2l| ) and this is especially true for low temperatures. On the other hand for 
higher order energy level spacing distributions, p^(S) is small for small values of S. Hence, 



9 



for low T the term containing the nearest neighbor spacing distribution, p^(S), is most 
important in determining the relaxation time r. As T increases r begins to depend on 
higher order energy level spacing distributions. Since more and more p^(S) begin to play a 
role as T increases, we expect the model details to become increasingly important. However 
it is easy to show that p^) (S) for large i must be universal: as the levels are far apart there is 
little level correlation and the spacing distribution therefore approaches a Gaussian. Hence 
we expect model independence to return when T reaches values ^ U. Finally, if P(i)(S) is 
less sensitive to model peculiarities than the density of states, D(E), we expect that the 
relaxation kinetics measured by the relaxation time r is approximately generic as it only 
depends on several low order spacing distributions in this low temperature range. We shall 
confirm this picture by computing P(i)(S) and in particular p^(S) in terms of D(E). 



B. The spacing distribution 



To relate (S) to the density of states, D(E), which specifies our models, let us take any 
two energy levels and consider the probability that the first level lies in the interval [E, E + 
dE] while the second level lies in [E + S, E + S + dS]. To find p^ (S) we use an approximate 
approach [12,01 analogous to mean field theory in which the energy distribution, D(E), is 



assumed to be locally random. This allows us to use a simple approach based on probability 
theory. First we note that there are % — \ levels in the interval [E,E + S] and the remaining 



N — i levels are outside this interval. The probability for this to occur is proportional to fll 
D(E)D(E + S) 1-1 D(t)dt I D(t)dt <ISdE 



rE+S 


(N-i) 


' rE+S 


1 - / D(t)dt 




/ D(t)dt 


JE 




JE 



(22) 



Integrating over E and normalizing the resulting expression, we obtain 



P(i)(S) 



rE+S 


(N-i) 


' rE+S 


1 - / D(t)dt 




/ D(t)dt 


JE 




JE 



l(i-l) 



dE (23) 



where C is the normalization factor. In spite of its involved appearance, this equation is 
easy to investigate. For example consider p^)(S), which is written 



/+oo rE+S 
D(E)D(E + S) 1- / D(t)dt 
-oo JE 



(N-l) 



dE 



(24) 



l-!l +s D{t)dt 



to the factor 



scale of 4j. If S 1, in the N 



(N-l) 



It is easy to see that if D(E) is substantially larger than zero on an interval £, then due 

, p(i)(5) will also be substantially larger than zero on a 
oo limit we can expand the integrand to obtain 



P(1) (S) = N(N + 1) 



[ +0 ° D l+2 (E)dE 



(25) 
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From Eq.(|25|) it can also be deduced that the scale on which P(i)(S) is substantially greater 
than zero is 0(-^). Furthermore, it is clear from Eqs. fl23|) and (|25| ) that P(i)(S) is determined 
by the values of a set of definite integrals of the density of states distribution, D(E), and its 
powers rather than the specific details of the level distribution itself. 

How sensitively doesp(i)(S') depend on D(E) ? Consider two completely different models 
specified by DUE) = exp{-E) with E e [0, +00], and D 2 (E) = -MkE d with E G [0, E ] and 
d > 0. Using Eq. we can explicitly compute the nearest neighbor spacing distribution 
for the two models. It is easy to show that both models give the same form of P(i)(S) 
for large values of the parameter d. Furthermore, even for d ~ 0(1) the difference is only 
~ (d+l)/d. The fact that the two chosen forms of D(E) are considerably different from one 
another shows that different models described by different forms of D(E) can have similar 
nearest energy level spacing distributions. Hence they can have similar relaxation times as 
specified by Eq. (O) in the appropriate temperature range. 



IV. A MORE GENERAL CASE 

In the last section we considered the case in the absence of level degeneracy. In that case 
the relaxation time can be expressed in terms of the level spacing distributions. However 
when some of the energy levels are degenerate, we were in general not able to write Eq. ( |TT|) 
in a simple form like Eq. fl2lD in terms of the level spacing probability, p^(S). In addition if 
the degeneracy gx of the energy levels rises exponentially as the number of the energy level 
increases we can not neglect the 772 term even if the temperature is much smaller than AE Q . 
In this more general case, the level degeneracies and the energetics will compete to control 
the relaxation. Thus we expect the relaxation time r to have a non-monotonic behavior as 
the temperature is changed. In particular a characteristic temperature Tf is found which 
roughly separates the relaxation regime which is controlled by energetics and the regime 
controlled by entropy. 



A. The characteristic temperature 



We now examine the special case of level degeneracy which is approximately realized in 
protein spectra where gx ~ 7 X and 7 is a constant greater than unity in this case local 
free energy F t equals Fj 

/ N-i „ +00 
r = 771+772 = / p^^exp % In 7 



Ei — i In 7. In this case it is easy to show, using Eq. (|T7|) , that 



s 

T 



dS 



Z 



It is easy to show that the 7/2 term as a whole scales as expiV(ln7 — ^ — Jk) for temper- 
atures T > St., and the temperature at which this term becomes order unity is given by 



u ^ N 



In 7 ' 



which is comparable with the average level spacing and becomes independent of 
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the ground state energy E Q as N approaches infinity. For T > Tf, \rj2\ is a rapidly increasing 
function of N and at T ~ Tf the two terms 771 and 772 become comparable. 

Our results show that the relaxation time of the model involves competition between the 
two terms 771 and r/ 2 . The fact that \r]i \ becomes comparable with \r] 2 \ at Tf implies a change 
of behavior as temperature T is swept across Tf. To gain further intuition concerning the 
behavior of the system at T = Tf, we examined the limit for which all the N levels in the 
quasi-continuous part of the spectrum are equidistant that U is the energy difference between 
any two adjacent levels. Again let AE be the distance between the native state and the 
lowest energy of the quasi-continuous part of the spectrum. The equilibrium probabilities 
of each level will then become 



where K = exp(hi7 — 5?) = exp(hi7(l — Here we have defined Tf — Because 

N is large, this expression for K together with the expressions for Pq and Pi above shows 
that for T < Tf the population of the energy levels will be given by P ~ 1 and Pj <C Pq 
for all % G {1,N}. The equilibrium state of the system is then the ground state of our 
spectrum. For T > Tf, on the other hand, Pj <C Pj + i for all i G {0, iV — 1}, so that the 
equilibrium state will be shifted to the upper part of the quasi-continuous spectrum. Hence 
for T < Tf or T > Tf the equilibrium state of the system is relatively well defined. In the 
case when T ~ Tf all the levels in the spectrum become almost equally probable and hence 
large fluctuations can be expected. ^From the kinetic point of view, this means that at 
T ~ Tf the fluctuations lead to slow relaxation and thus large values of r. In this sense this 
behavior is similar to the well known "critical slowing down" found for critical phenomena. 
From this argument it follows that for T ~ Tf one should expect occurrence of a maximum 
in the relaxation time of a system. Finally we note that the characteristic temperature can 
also be obtained by the convergence criterion for the series of Eq. (|TT|) , namely «oPi = 1- 

We may conclude from our discussions that as soon as our system has energy levels 
with increasing degeneracies, there exists a temperature scale Tf which is determined by 
the average level spacing of the spectrum and the degeneracy parameter 7. Close to Tf the 
tendency of increasing entropy overcomes the tendency of relaxing into a state with lowest 
energy, and this leads to a sharp increase in the relaxation time of our system (see below 
for numerical calculations). Thus we expect a change in the behavior of the relaxation 
as temperature is varied by crossing Tf. In this sense, the characteristic temperature Tf 
can be regarded as the "folding temperature". This is indeed what observed in Ref. |7| on 
situations similar to that discussed in the last paragraph. Numerical results for this case 
will be presented in the next subsection. 








K N -1 
K-l 



and 



Pi 




K N -1 
K-l 
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For T <C Tf, we can again neglect the 772 term, and the relaxation time can be expressed 
as follows 



T 



"rel 



N-l ,+oo 

N + E I P0)(S)exp 



i In 7 — 



dS 



(26) 



Similar to the discussion in the last section, it is clear that as the temperature increases 
from zero to Tf, r becomes increasingly dependent on the higher order energy level spacing 
distributions. However when T ~ 0, r is only determined by the nearest neighbor spacing 
distribution p^(S). Thus the relaxation at very low temperatures is still generic in the 
sense of the discussion of last section, namely r is insensitive to the density of states D(E). 

For proteins in general, it is reasonable to assume that the number of levels is large, 
i.e. that iV — > 00. Then the characteristic temperature Tf is of the order of In order 
to examine higher temperatures, we first consider T > Tf and the 7/1 term as given by 
Eq. fl2~6p . In this case the exponential in the integrand of Eq. (|26| ) is larger for larger 
values of the summation index i. Therefore for T > Tf, only higher order energy level 
spacing distributions P(i)(S) play a substantial role. However, as mentioned before, P(i){S) 
approaches a Gaussian for large i and hence the r\\ part of the relaxation time again becomes 
generic for T > Tf. In addition, it is easy to see that the leading contribution in the 7/2 
term is due to the difference in energies between the highest and the lowest energy levels for 
T > Tf and is of the order of exp N^j^. Since this also defines quite general properties of a 
given model, we expect weak model dependence for the 772 term as well. Hence we conclude 
that above the characteristic temperature, the relaxation time is only weakly dependent on 
the detailed features of a given model. 

The major model dependence is expected in the range < T < Tf, while Tf itself is 
determined by the degeneracy parameter 7 and average level spacing. The striking feature 
of the current case is that as T reaches Tf there is an increase in the dependence of r on the 
higher order energy level spacing distribution: at very low T it is p(i)(S) which determines 
r, while above Tf it is the higher p^(S) which is responsible. The important conclusion is 
that it is possible for different models which correspond to different density of states D(E), 
to have similar spacing distributions p^(S). If this is the case the folding time is weakly 
sensitive to the details of a given model in the lower temperature range T <ti U for the 
non-degenerate models and, and T <^Tf for the degenerate models. 



B. Numerical results 

Although we have obtained all our results analytically, it is useful to obtain some nu- 
merical data as this gives considerable intuition about the relaxation kinetics of the model 
studied here. For this purpose, we employ the model of Ref. j7|. In particular we assume 
that it has iV energy levels with equal nearest-neighbor spacings U in its quasi-continuous 
part and AE Q below is the ground state. We assume AE Q ^> U . The degeneracy of the 
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quasi-continuous part is given by 7 i_1 where % is the index of energy level. In the calculations 
we used N = 100, U = 1, and the energy gap between the ground state and the bottom of 
the quasi- continuous part of the spectrum AE Q = 12. 

Then from our general result given by Eq. flT7|), the relaxation time can easily be calcu- 
lated and we obtain, for the whole temperature range, 

T = N —- K [—)-^\-l<^) K TT^W (27) 

Here K = 7exp— ^ and the prime means differentiation with respect to K. A simpler 
expression can be obtained when the limit iV —>■ oo is taken. 

The relaxation time r as a function of temperature for various degeneracy parameters 7 
is shown in Fig. (0). We computed r in two ways: either from a direct numerical inversion of 
the matrix (M — a^dM) of Eq. (|]) and then finding its trace, or by using the analytical form 
of Eq. (|17D . Fig. @) shows that these two methods give exactly the same results throughout 
the whole temperature range, justifying the mathematical procedure which led to Eq. (|P7|). 
Several observations are in order. First, when the energy levels are non-degenerate, i.e. 
when 7 = 1, there are no entropic effects in the model and the system simply roles down 
the perfect funnel landscape in the relaxation process. In this case there is no characteristic 
temperature Tf and r is completely determined by the energy level spacing distribution, 
as discussed before. Secondly, for cases with increasing level degeneracies, i.e. for 7 > 1, 
the relaxation time shows the expected maximum. Also the position of the maximum is 
exactly at the characteristic temperature Tf (see below). The behavior is consistent with 
that reported in Ref. 0. Finally, the "transition" at Tf becomes sharper as 7 is increased. 
This is expected as it is similar to the situation that occurs in a finite system where a thermal 
phase "transition" becomes sharper when the degree of freedom is increased. 

In Fig. (H) the characteristic temperature as obtained from Tf rs + is shown as 
a function of the degeneracy parameter 7. From this expression Tf decreases monotonically 
as 7 is increased which must be true because higher degeneracies of energy spectrum leads to 
higher entropies involved. The data points in this figure were taken from the peak positions 
of Fig. (0) and are in good agreement with the theoretical definition. Hence we conclude 
that t takes maximum values at the characteristic temperature Tf. 



C. A discussion on the effect of roughness 

So far our analysis is rigorous when the energy landscape is a perfect funnel in the absence 
of roughness. Including arbitrary roughness will make the problem essentially unsolvable 
analytically. However under the assumption that the roughness is small, our analysis can 
be extended to estimate the effects of it. In this section we will not attempt a rigorous 
treatment of the influence of roughness on the polymer dynamics. Rather, we will specify 
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in what way the results of our theory will be modified if roughness is included. Our analysis 
follows the work of Leite and Onuchic ITH] . 

The energy landscape roughness can be modeled [|1(J by a distribution of states at a given 
value of the reaction coordinate X. The roughness is considered small if the width of the 
distribution is smaller than the average energy level spacing in the spectrum. Then, each 
energy level E(X) considered so far can be thought as being "smeared" out by an energy 
probability distribution g(X,E) = - — SE * x ^i ex P ~ ^se(x^ • Here SE(X) characterize the 
"strength" of the roughness or the width of the energy band corresponding to a particular 



value of X. For this rougness model, following Ref. [16], a useful concept which arises is 



a coordinate-dependent phase transition ||16|| . After the introduction of small roughness, 
the narrow band of states within SE(X) can be considered to be a subsystem with its own 
dynamics. This consideration predicts 0] that for an energy band with coordinate X there 
is a critical temperature TJX) = 8E{X) -i where Q(X) is a number of conformations 

corresponding to the the level at X. If T < TJX) for a particular X, the band at E(X) 
will behave in such a way that the dynamics inside this band is glass-like. This means that 
the system will tend to be frozen in a few low lying states of this band while relaxing inside 
it. This effect will have an important influence on the relaxational dynamics. In the case of 

protein folding we model £l(X) ~ 7 X , and therefore TJX) = SE ( X *> 1 where X = 0, 1, 2, .... 

(2x1117)2 

Although 5E may depend explicitely on X, let us first consider the limiting case when 
5E is a constant over the spectrum. In this case for system temperature T < — — — r 

(2Xln 7 )2 

there are If = (^fr) 2 ^ - low lying energy bands with "glass"-like dynamics. The lower the 
temperature, the more "frozen" bands would be in the system. As the global dynamics 
involves total number of energy states which is proportional to J2f Y = 7 _ x , a "global" 
phase transition temperature can be estimated, following Ref. [pjl, as Tf ~ — — — r where N 

is the number of energy bands (N = max{X}). It is important to compare this temperature 
scale with the folding temperature discussed previously, Tf ~ The number of "frozen" 
levels at the folding temperature is If ~ (jj-) 21 -^- — TfN^/Tf. This number can serve as a 
criterion for the definition of roughness "strength" . One can easily see that if the roughness 
SE < U, then If ~ 1. We thus conclude that only a few low lying energy bands with X ~ 1 
are frozen at Tf. In this case the results derived from a perfect funnel should be applicable 
for temperatures T ~ Tf, and the relaxation kinetics can be slowed down in the lower part 
of a spectrum near the ground state. 

The above discussion is for a constant SE and more realistically there is X dependence 
of this quantity. For protein models it is natural to use the measure of compactness, such 
as the total number of nearest contacts, to specify the energy states. If Cl is the maximum 
number of contacts for a polymer with L monomers, then we may define Cl — X to give the 
total number of contacts at energy E(X). It is reasonable to assume that for a structure 
with larger number of contacts, more interaction parameters are involved in computing the 
energy. In general for a given distribution of these parameters, the energy band width SE 
is thus larger. For this reason we expect a decreasing SE(X) as X is increased (although 
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the level degeneracy 7 X is increasing). Hence the validity of the perfect funnel results can 
be assessed by using the largest 8E(X) appropriate to the lower part of the spectrum and 
the discussion of the last paragraph. 



V. SUMMARY 

This work was motivated by a particular protein folding scenario based on a funnel-like 
energy landscape. We studied the relaxational behavior when a system possesses a perfectly 
smooth energy funnel. Our model is specified by a density of states D(E) for the quasi- 
continuous part, and a very low ground state level E Q . The levels of the quasi-continuous 
part may have exponentially increasing degeneracies charactered by a degeneracy parameter 
7. Because the funnel is smooth, the relaxation is very simple and can be obtained by 
using Metropolis rates. Indeed, the problem has been solved in closed form for the whole 
temperature range in the aabsence of roughness. 

We found that the dependence of the relaxation time, r, on the energy level and the 
level spacing distributions of the models displays three main types of behavior depending 
on the temperature T. In the case where the energy levels are non-degenerate, a general 
formula can be obtained relating r to the level spacing distribution. Because r is largely 
speaking determined by the nearest level spacing distribution, which we have shown to be 
only weakly dependent on D(E), we conclude that the relaxation behavior can be said to 
exhibit "universal" features. In the degenerate case which is more realistic for protein models, 
a characteristic temperature Tf is found which separates the relaxation regimes dominated 
by energetics or by entropic effects. Hence in this sense Tf can be thought of as a "folding 
temperature" for the model studied here. We found that r is weakly dependent on the 
specific density of states of a given model if T <C Tf while becoming more and more model 
sensitive as T approaches Tf. Again, at T > Tf the system becomes almost insensitive to 
the detailed features of D(E). Across Tf the relaxation time shows a maximum, indicating 
the "folding transition", in agreement with the earlier model of Zwanzig [|7|. A possible 
experimental study of the relationship between the energy level distribution and folding 
kinetics as examined by our model would involve a systematic study of several proteins using 
a combination of thermodynamic techniques such as calorimetry Jl5| and folding assays. 

The discussion presented here is only valid for a perfect funnel-like energy landscape, 
i.e. that of a "good" protein sequence. The connection to a particular model is through 
the density of states D(E) which can always be obtained numerically. If small amount 
of roughness is added via a finite width of the energy state distribution for a particular 
reaction coordinate X, the validity of the perfect funnel results can be examined following 



the approach of the recently published work of Ref. In general, our theory correctly 

describe the kinetics of the model system including roughness in a tempearture range T > 
N^T*?: below this temperature kinetics slows down by the roughness and approach glassy- 
like dynamics at T ~ T c 9 . As discussed in the last section, Tg is small for small amount of 
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roughness (small SE). 

Finally we comment that while our work was motivated by the protein folding problem, 
the model is however only specified by the density of states and relaxation in a perfect funnel 
with a low lying ground state. Thus the formula derived here are applicable to any other 
situations where a similar arrangement applies. On the other hand, as far as protein folding 
is concerned, the model studied here possesses many features of more realistic models. 
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FIGURES 

FIG. 1. A schematic plot of the perfect funnel-like energy landscape in reaction coordinate 
space. The energy spectrum has a low lying ground state with energy E a , and a quasi-continuum 
part which is specified by the levels Ei with i € [1, N\. 

FIG. 2. The logarithm of the relaxation time r as a function of temperature T for various 
degeneracy parameters 7 as indicated by the numbers near the curves. The unit of T is the 
level spacing U, and arbitrary unit is set for r. The solid squares were computed from a direct 
numerical inversion of the matrix (M — oiq5M) of Eq. (^) followed by a calculation of the trace 
of (M - a Q 5M)- 1 . The solid lines were calculated using the analytical form of Eq. (|i~7|). Clearly 
these two methods give exactly the same answers throughout the whole temperature range. 

FIG. 3. The characteristic temperature Tf as a function of the level degeneracy parameter 
7. Unit of Tf is the level spacing U. The solid line is obtained from Tf w + N ^ ■ The solid 
squares in this figure correspond to the temperatures at which the relaxation time achieves its 
maximum, i.e. the peak positions of Fig. @). These two prescriptions give nearly the same values 
for Tf. 
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Figure (2) 




Temperature 



TempleGraph 2.4, Origin: /morgoth/users/guo@celeborn - Thu Aug 22 1 1:47:27 1996 (data file was tau_exact_1.7) 




TempleGraph 2.4, Origin: /morgoth/users/guo@celeborn - Thu Aug 22 12:13:27 1996 (data file was tf_2) 



